International Journal of Engineering Technology and Management Sciences 
Website: ijetms.in Issue: 2 Volume No.7 March - April — 2023 
DOI:10.46647/ijetms.2023.v07i02.077 ISSN: 2581-4621 


A NOVEL SOFTWARE ENGINEERING APPROACH TOWARD 
USING MACHINE LEARNING FOR IMPROVING THE 
EFFICIENCY OF HEALTH SYSTEMS 


S.Shanmugapriya!, Ms. P.Devika ME? 
1PG Scholar, Department of Computer Science and Engineering, Nandha Engineering 
College(Autonomous), Erode, Tamilnadu, India 
Assistant Professor Department of Computer Science and Engineering, Nandha Engineering 
College(Autonomous), Erode, Tamilnadu, India2 


ABSTRACT 

This study examines how software engineering and machine learning interact in the context of health 
systems. We proposed the software framework and methodology as a fresh framework for health 
informatics. Health informatics engineering for machine learning (SEMLHI). The SEMLHI 
framework consists of four modules (software, machine learning, machine learning algorithms, and 
health informatics data), which group the tasks in the framework according to the SEMLHI 
methodology. This enables researchers and developers to examine health informatics software from 
an engineering standpoint and gives developers a new road map for creating health applications with 
system functions and software implementations. In order to comprehend both the function of objects 
linked with the system and the machine learning techniques that must be used on the dataset, users 
can study and analyze user demands with the help of our new technique, which sheds light on its 
qualities. Real data from a hospital run by the Palestinian Authority during the last three years make 
up our dataset for this study. The SEMLHI technique is broken down into seven phases: creating, 
managing, defining, and implementing procedures; gathering data; ensuring security and privacy; 
testing and evaluating performance; and delivering software applications. 
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1. INTRODUCTION 

The area of health informatics (HI) seeks to connect different ideas on a big scale. Typically, a 
healthcare dataset is discovered to be inadequate and noisy; as a result, reading data from dataset 
linkage typically fails within the first few minutes. field of software engineering. Because it can store 
data on a massive scale, machine learning (ML) is a fast expanding discipline of computer science. 
Many ML technologies may be used to evaluate data and generate information that can enhance the 
quality of work for both staff and doctors; however, there is presently no technique that can be utilized 
for developers. There has been a shortage of techniques to assessing which software engineering 
activities are best completed by automation and which require human involvement or human-in-the- 
loop approaches in the past. 

1.1 HEALTH DATASET ANALYSIS 

Health care analytics is a subset of data analytics that employs both historical and present data to 
provide actionable insights, enhance decision-making, and maximize results in the health care 
business. Health care analytics is utilized to benefit not just health care companies, but also to improve 
patient experience and health outcomes. The health-care business is brimming with useful 
information in the form of precise records. Many of these documents must be kept for a specific 
amount of time, according to industry standards. As a result, individuals working with "big data," or 
vast pools of unstructured data, have taken an interest in health care. Big data analytics in health care, 
as a still-developing discipline, has the potential to cut operational costs, enhance efficiency, and treat 
patients. 

1.2 MACHINE LEARNING 

Machine learning (ML) is a subset of artificial intelligence (AI) that enables software programmes to 
grow increasingly effective at predicting outcomes without explicitly programming them to do so. 
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Machine learning algorithms estimate new output values by using past data as input. Machine learning 
is commonly used in recommendation engines. Fraud detection, spam filtering, malware threat 
detection, business process automation (BPA), and predictive maintenance are all prominent 
applications. Machine learning is fundamental to the operations of many of today's biggest 
organizations, like Face book, Google, and Uber. For many businesses, machine learning has become 
a crucial competitive differentiation. Classical machine learning is frequently classified by how an 
algorithm learns to improve its prediction accuracy. There are four fundamental techniques to 
learning: supervised learning, unsupervised learning, semi-supervised learning, and reinforcement 
learning. The algorithm that data scientists employ is determined on the sort of data they wish to 
forecast. 

1.3 SOFTWARE ENGINEERING 

Software engineering often begins with the first step as a user-request initiation for a specific job or 
an output. He sends his request to a service provider company. The software development team 
divides requirements into three categories: user requirements, system requirements, and functional 
requirements. The requirements are gathered by conducting user interviews, consulting a database, 
researching the existing system, and so on. Following requirement collecting, the team determines if 
the programme can be designed to meet all of the user's expectations. The developer then creates a 
roadmap for his strategy. Understanding the constraints of software products is also part of system 
analysis. 


2. LITERATURE SURVEY 

2.1 A METHOD FOR EVALUATING AND DEFENDING PRACTICAL SIGNIFICANCE IN 
SOFTWARE ENGINEERING 

In this study, Richard Torkar et al. suggest the assessment of practical importance, which answers the 
issue, is a primary objective of empirical research in software engineering. if the reported effects of 
certain comparable therapies demonstrate a meaningful difference in practice in actual settings 
Despite the fact that There are several common ways for assessing statistical significance; however, 
linking it to practical importance is neither simple or usual; rather, just a few empirical studies in 
software engineering measure practical significance in a defined and systematic manner. Way In this 
study, we show that Bayesian data analysis provides appropriate techniques for systematically 
assessing practical importance. In a case study comparing various test procedures, we show our 
assertions. The data from the case study was previously evaluated (Afzal et al., 2015) using standard 
statistical significance approaches. We create a multilevel model of the same data and fit and evaluate 
it using Bayesian approaches. To quantitatively tie our statistical analysis result to a realistically 
useful context, we utilize cumulative prospect theory on top of the statistical model. This is then used 
to analyze and argue for practical importance. Our research shows that Bayesian analysis can provide 
a theoretically sound yet practical framework for empirical software engineering. A significant side 
consequence is that any uncertainty in the underlying data is communicated via the statistical model, 
revealing its implications for practical importance. Thus, when combined with cumulative prospect 
theory, Bayesian analysis facilitates effortlessly analyzing practical importance in an empirical 
software engineering setting, possibly clarifying and expanding research relevance for practitioners. 
2.2 ANALYZING GROUPS OF SOFTWARE ENGINEERING REPLICATIONS: A 
PROCEDURE AND GUIDELINES 

Adrian Santos et al. argued in their work that researchers from various groups and institutions are 
collaborating on constructing groups of studies through replication (i.e., conducting groups of 
replications). Diverse aggregation approaches are being used to evaluate groupings of data. 
Replications. The use of inappropriate approaches to aggregate replication findings may jeopardize 
groups of replications’ ability to give in-depth insights from experiment outcomes. Objectives: To 
aggregate software engineering (SE) replication data, provide an analysis mechanism with a set of 
integrated criteria. Method: We compare the features of replication groups in SE to those in other 
mature experimental fields like medicine and pharmacology. Given their differences, the limitations 
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of joint data analysis of groups of SE replications, and the guidelines provided in mature experimental 
disciplines to analyze groups of replications, we developed an analysis procedure with a set of 
embedded guidelines tailored specifically to the analysis of groups of SE replications. To demonstrate 
its use, we apply the suggested analytic approach to a representative collection of SE replications. 
Results: During the aggregate of replication results, all of the information included within the raw 
data should be utilized. The proposed analytic technique supports the use of stratified individual 
participant data as well as aggregated data in tandem to examine groups of SE replications. 
Conclusion: In research publications, the aggregation approaches used to examine groups of 
replications should be justified. This improves the dependability and openness of collaborative 
outcomes. The suggested guidelines should make this task easier. 

2.3 A PROGRESSION MODEL OF STARTUP SOFTWARE ENGINEERING GOALS, 
CHALLENGES, AND PRACTICES 

In this research, Eriks Klotins et al. claim that software start-ups are emerging as providers of 
innovation and software-intensive products. Traditional software engineering approaches, on the 
other hand, are not reviewed in the context of start-up goals and problems. As a result of this, In the 
startup setting, there is insufficient support for software engineering. The purpose is to collect data 
on engineering goals, difficulties, and practices in start-up organizations in order to identify trends 
and patterns that characterise engineering work in start-ups. This type of data helps academics to 
better understand how objectives and obstacles connect to practices. This information may then be 
used to influence future research targeted at developing solutions to those aims and obstacles. 
Furthermore, these trends and patterns might help practitioners make better judgments in their 
engineering practice. Technique: We collect firsthand, in-depth experiences from a broad sample of 
software start-ups using a case survey method. We explain and discover trends using open coding 
and cross-case analysis, and we back up our conclusions using statistical analysis. Results: We 
examined 84 start-up examples and identified 16 common aims, 9 difficulties, and 16 engineering 
methods. These objectives, difficulties, and techniques have been assigned to stages of the startup life 
cycle (inception, stabilization, growth, and maturity). As a result, the progression model driving 
software engineering efforts in start-ups was created. 

2.4 A VALUE THEORY FOR FEATURE SELECTION BASED ON VALUE IN SOFTWARE 
ENGINEERING 

In this study, Pilar Rodrguez et al. offer Value-Based Software Engineering, which emphasizes the 
relevance of value in software-related choices. In the context of feature selection, software features 
deemed to be more valuable receive precedence in the development process. This study is concerned 
with what When selecting software features, value signifies We conducted and evaluated semi- 
structured interviews with 21 important stakeholders (decision-makers) from three 
software/software-intensive organizations using grounded theory, in an environment where value- 
based decision-making was already established. Our research resulted in the development of a value- 
based feature selection theory that defines the type of value propositions evaluated by key 
stakeholders when picking software features (i.e., decision-making criteria for deciding on software 
features, as proposed by Boehm (2003)). We discovered that some value propositions (core value 
propositions) were shared by all three firm examples, whilst others were depending on the context in 
which a company works and the features of the product under development (specific value 
propositions). Furthermore, value propositions differ depending on the stakeholder group and the type 
of feature being evaluated. Our research sheds light on value in the context of feature selection and 
develops new ideas for value-based feature selection, such as new value propositions. 


2.5 EMPIRICAL SOFTWARE ENGINEERING RESEARCH USING BAYESIAN DATA 
ANALYSIS 

Statistics, according to Carlo A. Furia et al., come in two flavors: frequents and Bayesian. Frequents 
statistics have long dominated empirical data analysis for historical and technological reasons, and 
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they undoubtedly remain dominant in empirical software engineering. This The situation is 
unfortunate because frequentist statistics have a number of flaws, such as a lack of flexibility and 
results that are unintuitive and difficult to interpret, that limit their effectiveness when dealing with 
the heterogeneous data that is becoming increasingly available for empirical analysis of software 
engineering practice. In this study, we identify these flaws and provide Bayesian data analysis 
strategies that give practical benefits, such as clearer answers that are both resilient and nuanced. 
Following a brief overview of the fundamental techniques of Bayesian statistics, we give a reanalysis 
of two empirical studies on the usefulness of automatically generated tests and the performance of 
programming languages. We illustrate the tangible advantages of the latter by contrasting the original 
frequentist analysis with our new Bayesian analyses. Finally, we argue for a greater emphasis on 
Bayesian statistical approaches in empirical software engineering research and practice. 

2.6 EVALUATING A REMOTELY CONDUCTED EDUCATIONAL ESCAPE ROOM FOR 
TEACHING SOFTWARE ENGINEERING 

In this study, ALDO GORDILLO et al. offer with the rise of distant learning, instructors have faced 
new obstacles. Developing effective and inspiring group activities for students in the remote 
classroom is one of these problems. one of the highest concerns to be addressed When done face-to- 
face, educational escape rooms have proved to be interesting and effective learning exercises, 
according to available literature. However, no prior research has examined the educational 
effectiveness of these activities when carried out remotely. Furthermore, none of the reported 
educational escape rooms in the literature were meant to teach software modeling. This article 
examines a remote educational escape room used to teach software modeling in a software 
engineering fundamentals course. A pre-test and post-test to assess students’ learning gains, a 
questionnaire to collect students’ impressions, and a web platform to automatically capture data on 
students’ interactions were all employed. This article makes two contributions. On the one hand, it 
provides proof that remote educational escape rooms may be successful learning activities for the 
first time. On the other hand, for the first time, it demonstrates that educational escape rooms are 
excellent and interesting activities for teaching software modeling. 

2.7 CRITICAL THINKING ASSESSMENT IN ONLINE SOFTWARE ENGINEERING 
EDUCATION: A SYSTEMATIC MAPPING STUDY 

In his study, OSCAR ANCN BASTAS et al. stated that critical thinking consisted in analyzing and 
assessing the coherence of argument. This competence is critical when discussing software quality 
(SQ). SQ is strongly tied to the capacity of the engineer to Students must accurately appraise and 
differentiate amongst answers, thus they must analyze, evaluate, and form conclusions. As a result, 
critical thinking becomes an essential component of software engineer training. This paper describes 
a systematic mapping study (SMS) whose goal was to discover, organize, and characterise key 
features in online critical thinking teaching-learning for software engineers. Based on the SMS 
results, we present a preliminary framework for assessing critical thinking in software engineering 
instruction in the context of online higher education. This idea is anticipated to serve as a foundation 
for discipline teachers when evaluating critical thinking in the context of online instruction. 

2.8 GAMIFYING SOFTWARE ENGINEERING TOOLS TO ENCOURAGE COMPUTER 
SCIENCE STUDENTS TO BEGIN AND COMPLETE PROGRAMMING ASSIGNMENTS 
EARLIER 

In this work, Madison W et al. suggest Contribution: According to research, computer science (CS) 
students who begin programming tasks do better (PAs) Early finishers typically obtain higher ratings. 
This paper describes and examines a gamification method that uses software engineering tools to 
push CS students to begin and complete PAs sooner. Background: Computer science may be 
challenging to master since students frequently struggle with mistakes and how to properly test their 
programmes. For these reasons, it is critical that students begin their PAs as soon as possible. In 
addition, software engineering technologies like version control and unit testing are becoming 
increasingly crucial for students to understand early in their careers. Intended Outcomes: This 
gamification method intends to motivate CS students to begin and complete PAs early, as well as to 
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teach excellent practices in software engineering. Application Development: An open-source 
gasification system called the Leader board was created to encourage students to start and finish 
projects on time. The Leader board awards students who complete PA unit assessments ahead of time 
with gamified points. Github Classroom, a build server, and the Model learning management system 
are used to fully automate the system. Findings: Students who utilized the Leader board did not begin 
assignments considerably faster; nevertheless, they completed assignments earlier, committed code 
more frequently, and passed more unit tests. The Leader board was encouraging for the kids, and 
completing unit assessments was thrilling for them. 

2.9 ACTIVE METHODOLOGIES SUPPORTED BY THE VIRTUAL CAMPUS FOR 
IMPROVING LEARNING OUTCOMES IN SOFTWARE ENGINEERING 

In this research, Alicia Garca-Holgado et al. claimed that there are some subjects that are perceived 
as a challenge. Engineering, where the uniqueness of its concepts and the necessity to apply abstract 
thinking necessitate significant effort to comprehend new paradigms of information system design 
and development. Because of these challenges, pupils approach the topic with little enthusiasm. The 
current work presents a succession of teaching approach improvements implemented during the 
previous three academic years, with a focus on the implementation in 2017-18. By comparing the 
results achieved by students studying active methodology in 2017-18 with the results received by 
students studying active methodology in 2013-14, the results demonstrate a good link between the 
students' engagement and the subject's success ratio. 

2.10 IMPROVING STUDENT ENGAGEMENT THROUGH PROJECT-BASED 
LEARNING: A SOFTWARE ENGINEERING CASE STUDY 

In this study, Paula Morais et al. offer In the field of information and communication technologies, 
aside from the issue of involvement, students Learning modeling and programming concepts is 
frequently tough. The causes for these issues are widely recognised and detailed in the literature, and 
they hint to problems with abstraction and logic thinking. This article covers an experiment that was 
carried out in order to improve the learning experiences of students enrolled in the Computer Science 
bachelor's degree course, who attended three curricular units: Information Systems Development, 
Data Structures, and Web Languages and Technologies. Teachers employed project-based learning 
as an active learning paradigm in their approach. 

2.11 THE ASEST+ FRAMEWORK FOR IMPROVING TEAMWORK IN AGILE 
SOFTWARE ENGINEERING EDUCATION 

In this study, Daymy Tamayo et al. suggest this article describes an upgraded version of agile software 
engineers stay together (ASEST+). ASEST is a framework that tries to improve team cohesiveness, 
resulting in better team learning and software engineering student teams. Background: Effective 
cooperation is critical for the success of agile software development and is thus an important topic in 
contemporary software engineering education. A tentative proposal for ASEST+ was offered in the 
prior work. An enhanced version, more appropriate for agile practice teaching and taking coherence 
antecedents into account, is detailed here. The desired outcome is: A teamwork-supporting teaching- 
learning framework for agile software education. ASEST+ is based on Scrum teams and combines 
learning methodologies to teach students collaborative and technical agile processes. ASEST+ 
develops policies for role assignment and team rule agreements in order to govern communication 
and address conflict management agile methods. ASEST+ covers the most significant antecedents, 
which are personality attributes, conflict resolution, and task interdependence. Findings: When 
compared to the control group, using ASEST+ significantly boosts students' favorable evaluations of 
team cohesiveness, team performance, and team learning. 


2.12 A NOVEL SOFTWARE ENGINEERING APPROACH TOWARD USING MACHINE 
LEARNING FOR IMPROVING THE EFFICIENCY OF HEALTH SYSTEMS 

In this study, MOHAMMED MOREB et al. suggest Machine learning has recently been a popular 
academic area. As a result, this research looks on the interplay between software engineering and 
machine learning in the context of health systems. We suggested a unique framework for health 
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informatics: the software engineering framework and methodology for machine learning in health 
informatics (SEMLHI). The SEMLHI framework consists of four modules (software, machine 
learning, machine learning algorithms, and health informatics data) that organize the framework's 
tasks using a SEMLHI methodology, allowing researchers and developers to analyze health 
informatics software from an engineering perspective and providing developers with a new road map 
for designing health applications with system functions and software implementations. Our new 
technique illuminates its properties and enables users to study and evaluate user needs in order to 
understand the role of objects associated to the system as well as the machine learning algorithms that 
must be applied to the dataset. Our dataset for this study is genuine data that was initially obtained 
from a hospital managed by the Palestinian Authority during the previous three years. The SEMLHI 
technique is divided into seven stages: developing, implementing, managing, and defining processes; 
organizing information; assuring security and privacy; performance testing and assessment; and 
software application release. 

2.13 AN EMPIRICAL INVESTIGATION OF NONVERBAL COMMUNICATION IN 
SOFTWARE ENGINEERING 

PAOLO CIANCARINI et al., has proposed in this paper Correspondence among people comprises 
of both verbal and non verbal parts. The latter may occasionally convey concepts or ideas that the 
former cannot. Software engineering is no different. Using distributed cognition as a conceptual 
palette and reference, this paper first conducts a theoretical analysis of the role of nonverbal 
communication in software development teams. After that, it presents the results of an empirical study 
involving 38 Russian IT professionals who discussed their experiences with communicating and 
interacting while creating software artifacts. A distributed approach to cognition is supported in many 
ways by the findings of this empirical investigation. In addition, our findings establish a new 
framework for future research and provide valuable insights into how software development teams 
can improve communication. 

2.14 ON USING GREY LITERATURE AND GOOGLE SCHOLAR IN SYSTEMATIC 
LITERATURE REVIEWS IN SOFTWARE ENGINEERING 

In this study, AFFAN YASIN et al. offer the inclusion of grey literature (GL) is critical for removing 
publication bias. Collecting available evidence on a certain problem The number of systematic 
reviews of the literature (SLRs) The use of GL in Software Engineering (SE) is growing, but we don't 
know how widespread it is in these SLRs. Furthermore, while Google Scholar is quickly becoming 
the search engine of choice for many scholars, the extent to which it can discover primary studies is 
unknown. This tertiary education is an effort to I assess the use of GL in SLRs in SE Furthermore, 
this study provides ways for classifying GL and a quality checklist to utilize for GL in future SLRs; 
ii) investigate whether it is possible to identify scholarly publications for academic research using 
just Google Scholar. 

2.15 A SYSTEMATIC MAPPING STUDY OF SECURITY IN TELEHEALTH SYSTEMS 
FROM THE PERSPECTIVE OF SOFTWARE ENGINEERING 

In this study, GASTN MRQUEZ et al. propose that Telehealth systems provide remote treatment to 
elderly and physically disabled patients. as well as remote operations, treatments, and diagnostics In 
this context, various systemic features (such as security) must be met in order for Telehealth systems 
to work. Although previous studies describe many security incidents using Telehealth systems, it is 
difficult to have a clear perspective on which security vulnerabilities have been documented and 
which remedies have been provided. Furthermore, because Telehealth systems are made up of several 
software systems, it is unclear whether important aspects of Software Engineering are crucial for 
developing safe Telehealth systems. This paper describes a systematic mapping study (SMS) aimed 
at detecting, organizing, and characterizing security problems in Telehealth systems. We identified 
and categorized 41 primary studies from over a thousand papers. The findings show that I four 
security classifications (attacks, vulnerabilities, weaknesses, and threats) focus on the most frequently 
reported security issues; (ii) three security strategies (detect attacks, stop or mitigate attacks, and react 
to attacks) characterise security issues; and (iii) the most relevant research themes are related to 
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insecure data transmission and privacy. According to the SMS results, software design, requirements, 
and models are critical areas for developing safe Telehealth systems. 

2.16 SOFTWARE DEVELOPMENT FOR IOT-POWERED DATA ANALYTICS 
APPLICATIONS 

In this research, AAKASH AHMAD et al. claim that Internet of Things Driven Data Analytics (loT- 
DA) has the potential to excel data-driven operationalisation of smart settings. However, there has 
been little study on how IoT-DA applications work. within the framework of the software and system 
engineering life-cycle, are planned, implemented, operational zed, and evolved This paper 
experimentally develops a methodology that can be used to systematically explore the influence of 
software engineering (SE) processes and their underlying practices in the development of IoT-DA 
applications. First, we provide an assessment framework based on current frameworks and 
taxonomies to evaluate SE software processes, techniques, and other artifacts for loT-DA. Second, 
we conduct a systematic mapping analysis to pick 16 SE processes (from academic research and 
industrial solutions) for loT-DA. Third, we use our created evaluation framework, which is based on 
17 separate criteria (also known as process activities), to conduct fine-grained investigations of each 
of the 16 SE processes. Fourth, we illustrate the construction of an IoT-DA healthcare application 
using our suggested framework in a case study. Finally, based on the framework's support for process- 
centric software engineering of IoT-DA, we highlight important obstacles, recommended practices, 
and lessons learned. The findings of this study can help academics and practitioners design 
developing and next-generation IoT-DA software solutions. 

2.17 SECURITY APPROACHES IN SECURE SOFTWARE ENGINEERING: A 
SYSTEMATIC MAPPING STUDY 

In this study, RAFIQ AHMAD KHAN et al. suggest Software systems have been significantly 
changed and have become an indispensable component of human society in the current digital era. 
Such widespread usage of software systems involves vast amounts of essential data. that must 
unavoidably be safeguarded It is critical to ensure that these software systems not only meet the 
demands or functional requirements of the users, but also that they are secure. However, current 
research indicates that many software development processes do not explicitly integrate software 
security protections as they progress from demand engineering to their ultimate losses. Integrating 
software security into all stages of the software development life cycle (SDLC) has become critical. 
Various strategies, techniques, and models for dealing with software security have been proposed and 
developed; yet, only a handful of them give good evidence for developing safe software applications. 
The primary goal of this research is to investigate security measures in the context of secure software 
development (SSD) during the study of systematic mapping (SMS). 116 studies were chosen based 
on the inclusion and exclusion criteria. Following the extraction of data from the 116 articles, they 
were categorized based on the quality evaluation, software security approach, SDLC phases, 
publishing venue, and SWOT analysis. 

2.18 A SYSTEMATIC MAPPING STUDY OF TEAM FORMATION IN SOFTWARE 
ENGINEERING 

In this study, ALEXANDRE COSTA et al. suggest that software team building is an important project 
management activity. However, For most businesses, building proper teams is a difficulty. The goal 
of this research is to assess and synthesize the state of the art in software team formation research. 
Furthermore, we intend to create taxonomy out of the discovered body of knowledge in software team 
building. Method: Out of 2516 main papers, 51 were chosen and evaluated using a Snowballing- 
based systematic mapping research. We categorized the studies based on the research methodologies 
employed, the overall quality of the investigations, as well as the features of the created teams and 
the offered solutions. The bulk of the research employs search and optimization techniques in their 
methods. Furthermore, technical features are the most commonly used to develop individual profiles 
throughout the team creation process. In addition, we suggested taxonomy for software team building. 
Conclusion: Search-based methods that mix search and optimization techniques with technical 
features are widely used. However, there is a trend toward using non-technical features as 
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supplementary information. Concerning research gaps, we stress the extent of subjectivity in software 
team creation as well as the offered solutions' lack of scalability. 

2.19 THE INFLUENCE OF LEXICON ADAPTATION ON EMOTION MINING USING 
SOFTWARE ENGINEERING ARTIFACTS 

Sentiment analysis and emotion mining approaches are rapidly being employed in the field of 
software engineering, according to MICHAL R et al. However, previous trials have not produced 
good accuracy findings. The fundamental reason of this predicament, according to researchers, is a 
lack of adaptation of emotion mining algorithms to the unique context of the domain. The article 
discusses study aimed at determining if adapting the lexicon with emotional intensity of words in the 
context of software engineering enhances sentiment analysis reliability. A new vocabulary is created 
for this purpose, in which terms are assessed as though they were employed in the field of software 
engineering. A comparison experiment using emotion mining based on a generic and a software 
engineering specialized vocabulary yields no notable variations in findings. 

2.20 THE INFLUENCE OF LEXICON ADAPTATION ON EMOTION MINING USING 
SOFTWARE ENGINEERING ARTIFACTS 

Sentiment analysis and emotion mining approaches are rapidly being employed in the field of 
software engineering, according to MICHAL R et al. However, previous trials have not produced 
good accuracy findings. The fundamental reason of this predicament, according to researchers, is a 
lack of adaptation of emotion mining algorithms to the unique context of the domain. The article 
discusses study aimed at determining if adapting the lexicon with emotional intensity of words in the 
context of software engineering enhances sentiment analysis reliability. A new vocabulary is created 
for this purpose, in which terms are assessed as though they were employed in the field of software 
engineering. A comparison experiment using emotion mining based on a generic and a software 
engineering specialized vocabulary yields no notable variations in findings. 

2.21 MOVING DISCRETE MATHEMATICS AND SOFTWARE ENGINEERING 
TOGETHER 

In this study, Wanwei Liu et al. claim that students of software engineering should be trained in 
software development. Early in the curriculum, students are introduced to software development 
activities. This requires addressing the difficulty of involving students in software development prior 
to enrolling in the software engineering course. We suggest a method in this study. strategy for 
connecting courses in the software engineering curriculum by assigning complete development 
projects to students in software development preparatory courses We discuss the execution of the 
suggested technique and teaching methods utilizing various practical and thorough projects derived 
from discrete mathematics themes, using the Discrete Mathematics (DM) course as an example. There 
are detailed explanations of the sample projects, their applications, and training results. The outcomes 
and lessons learnt from implementing these practices indicate that it is a promising method of 
connecting courses in the software engineering curriculum. 

2.22 IMPROVED ENERGY-USE MULTI-SENSOR OBJECT DETECTION IN WIRELESS 
SENSOR NETWORKS DANIYAL 

Alghazzawi et al. suggest that independent sensor networks capable of sensing physical parameters 
like temperature, pressure, and humidity are distributed geographically within Wireless Sensor 
Networks (WSNs). Examples include energy, pressure, and sound. WSNs are resilient and have a 
secure connection to the physical environment. Data aggregation (DA) is an important part of WSN. 
helps cut down on energy use (EC). Existing research efforts have discovered DA with a high 
aggregation rate for WSNs in order to have reliable data. centered on DRINA (In-Network 
Aggregation Data Routing). Nevertheless, there is none. achieving an effective balance between 
routing and overhead; however, the EC DA requirements remained unmet. The Bayes Node's 
detection of things places the same event in specific locations. nodes of sensors (SNs). For effective 
DA at the sink in a heterogeneous environment, the Scheduling Multi-Sensor Data Synchronization 
(MSDSS) framework is proposed. Secure and energy-efficient In-Network Aggregation Sensor Data 
Routing (SEE-INASDR) is developed using a sensor network based on dynamic routing. (DR) 
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structure in WSNs to ensure the security of data transfers. The Polynomial Distribution (BNEPD) 
method decreased the Energy Drain Rate (EDR), and the poly distribution technique decreased the 
Communication Overhead (CO) by 39%, as demonstrated by our experimental results. The Network 
was also improved by the MSDSS structure that was planned. Lifetime (NL) is cut 15% shorter. 
Additionally, Data Aggregation Routing was improved by 10.5% with this framework. DAR). Last 
but not least, the SEE-INASDR architecture saved a lot of money. Utilizing a secure and energy- 
efficient routing protocol (SEERP) results in a 51 percent reduction in EC. 

2.23 TASK SCHEDULING IN THE CLOUD BASED ON TWO STAGES DYNAMIC 
ALGORITHM STRATEGY 

M.Deepika et al. presented in this study to improve task allocating performance and reduce illogical 
task allocation. Allocations in a cloud environment, this research proposes a two-stage technique. 
Strategy. Initially, a job classifier is driven by the design of a Naive Bayes classifier. The approach 
is used to classify occupations based on past scheduling data. Certain A number of virtual machines 
(VMs) of different sorts are built in response. This saves time. the time it takes to generate virtual 
machines during task allocation Jobs will be available in the next step. are dynamically coordinated 
with solid virtual machines Several dynamic algorithms are recommended for work allocation, 
accordingly. The exploratory finding demonstrates that effectively boosts the cloud's task allocation 
performance and complete the load In compared to conventional solutions, the splints of cloud 
resources. 

2.24 AN INVESTIGATION OF A ROUTING APPROACH FOR IN-NETWORK 
AGGREGATION IN WIRELESS SENSOR NETWORKS 

In this paper, SSSUDHA et al. propose that we may construct data aggregation utilising the Data 
aggregation and routing techniques can help to lower the cost of Wireless sensor network 
communication Traffic congestion occurs when one or more of the many sensor nodes detects events. 
The network should inform the occurrence to save electricity. Only when an event occurs, 
appropriately. Overhead happens in Because of its poor scalability, INFRA. According to the 
projected The DRINA algorithm (Data Routing for In-Network Aggregation) decreases 
communication costs and conserves energy By constructing the routing tree, we optimised the 
reducing the amount of duplicate routes and removing the superfluous data. The DRINA's 
performance has been compared to three others. additional protocols known: the Information Fusion- 
based Role Algorithms for Assignment (InFRA), Shortest Path Tree (SPT), and The algorithm of 
cantered-at-nearest-source (CNS). 

2.25 POLYNOMIAL DISTRIBUTION OF BAYES NODE ENERGY TO IMPROVE 
WIRELESS SENSOR ROUTING NETWORK 

In this paper, Karthikeyan et al. propose a Wireless Sensor Network to monitor and manage the 
physical environment using a huge number of tiny sensors. low-cost sensor nodes Existing Wireless 
Sensor Network (WSN) technique given increased latency due to sensed data transfer via continuous 
data gathering as well as energy consumption To solve the routing problem and decrease energy 
consumption, The Bayes Node Energy and Polynomial Distribution (BNEPD) approach is presented. 
In a wireless sensor network, energy-aware routing is used. Energy Distribution at the Bayes Node 
first distributes sensor nodes that detect a same event (1.e., temperature, etc.) The Bayes rule is used 
to direct pressure and flow into specified locations. The detection of objects comparable events is 
completed and delivered to the sink based on the Bayes probability As a consequence, energy usage 
is reduced. The Polynomial Regression follows. The function is applied to the target object of 
comparable events evaluated for various sensors. Combined. They are calculated using the lowest 
and maximum values of object events. Shifted to the sink node finally, the Poly Distribute method 
distributes the data properly. Nodes of sensing 


3. COMPARATIVE ANALYSIS 
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Title Techniques & | Parameter Analysis Future Work 
Mechanisms 

A Method to Assess and | We demonstrated the | Our approach develops | In future work, we plan 

Argue for Practical | approach on data from a | a Bayesian model of the | to demonstrate our 

Significance in | previously published | data, then it applies | approach on larger case 

Software Engineering study comparing | cumulative prospect | studies, and to perform 


exploratory testing to 
test-case based testing 


theory on top of the 
model to incorporate a 
quantitative notion of 
utility and how 
probabilities are 
subjectively perceived 


more extensive 
validations of its 
usefulness in practice. 


by humans facing a 
decision 
A Procedure and | To the best of our | The aggregation | With the aim of easing 
Guidelines for | knowledge, no previous | techniques used to | the application of the 
Analyzing Groups of | attempts have been | analyze groups of | analysis procedure and 
Software Engineering | made in SE to provide | replications should be | with a view to 
Replications guidelines for analyzing | justified in research | reproducibility, the 
groups of SE | articles. This will | supplementary material 
replications increase the reliability 
and transparency of 
joint results. The 
proposed guidelines 
should ease this 
endeavor 
A Progression Model of | We carefully removed | Requirements Parts of a road-map that 
Software Engineering | any feature or function | documentation enables | concerns near future are 


Goals, Challenges, and 
Practices in Start-Ups 


not essential to testing 
growth and value 
hypotheses. Of which 
two, the value 
hypotheses is prioritized 


to create a plan, 
outlining what features 
to implement when, i.e., 
develop a product road- 
map. 


more detailed; however 
long-term plans are 
described at a higher, 
milestone level. 


A Theory of Value for 
Value-Based Feature 
Selection in Software 
Engineering 


Numerous software 
companies embrace 
today the paradigm shift 
led by the VBSE 
agenda, placing value at 
center stage in the SE 
landscape 


Understanding the 
nature of value for 
feature selection will 
help software 
companies, and their 
decision making teams, 
better comprehend 
value and provide them 
with a tool to improve 


Future work comprises 
extending the theory by 
integrating additional 
cases. 


their value-based 
decision making 
Bayesian Data Analysis | To help this process, | Bayesian analysis | We plan to come up 
in Empirical Software | Section 5 discussed | stresses precise | with a more 
Engineering Research some basic guidelines | modeling of | comprehensile set of 
that should be | assumptions, guidelines in future 
uncertainty, and domain | work. 
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applicable to a variety 
of common analyses. 


features, which help 
gain a deep 
understanding of the 
data; 


Evaluating an 
Educational Escape 
Room Conducted 


The results of the article 
provide strong and 
unquestionable 


These web applications 
were highly interactive 
and communicated with 


Therefore, an 
interesting future work 
opened by this research 


Remotely for Teaching | evidence that the remote | Escape in order to verify | is to compare an 
Software Engineering educational escape | puzzle solutions, | educational escape 
room conducted was a | synchronize its state | room conducted faceto- 
highly effective and | among teammates, and | face with the same 
very engaging activity | show real-time | escape room conducted 
for learning software | notifications remotely. 
modeling. 
Evaluation of Critical | Proposal for a | Our proposal takes a Future work, within the 
Thinking in Online | framework for the | generic approach, in | authors’ study 
Software Engineering | incorporation of critical | other words it could be | framework, will present 
Teaching: A Systematic | thinking in the context | applied in other | deeper analysis of the 
Mapping Study of software engineering | teaching contexts, not | proposal, followed later 
education online. only software | by a start-up version in 
engineering. an experimental 
initiative, to obtain 
indications of how the 
proposal can be adapted 
or improved. 
Gamifying Software | Applying self- | include evaluating the | Future work involves 
Engineering Tools to | determination theory to | Leader board during a | deploying the Leader 
Motivate Computer | these results suggests | single semester that was | board to additional 
Science Students to | that combining team- | disrupted midway | classes to ensure 
Start and Finish | based leaderboards with | through by COVID-19 | generalization of the 
Programming an automated unit | emergency remote | results, utilizing online 
Assignments Earlier testing system learning development 
environments with auto- 
grading 
Improvement of | One of the aspects that | The technological | In future courses in 
Learning Outcomes in | could have | ecosystem oof the | order to refine the 
Software Engineering: | complemented and | university, through the | measures implemented 
Active Methodologies | benefited this active | virtual campus and the | and extract a series of 


Supported Through the 


approach is the use of 


space in Google Drive, 


guides and tools so that 


Virtual Campus digital badges that allow | has allowed to | the experience can be 
to incorporate also | coordinate extrapolated to other 
gamification into the software engineering 
course. subjects. 

Improving Student | A qualitative | In any investigation, the | PBL include helping 

Engagement With | questionnaire divided | research strategy is a | students to develop 

Project-Based into three parts was | relevant decision since | flexible knowledge, 

Learning: A Case Study | designed. The first part | it communicates the | practical problem- 

in Software Engineering | consists of 16 ordinal | expected results of a | solving, learning and 


variables focusing on 


study and how these 


collaboration skills, and 
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the technical skills that 
students should develop 
in each CU 


results should be 


evaluated. 


intrinsic motivation. 
PBL, therefore, has the 
potential to prepare 
students more 


effectively for the future 


Improving Teamwork | In addition, further | The results of a study of | Future work will further 
in Agile Software | research will explore the | two groups of students | explore the validity of 
Engineering Education: | meditational role of | applying ASEST+ | the ASEST+ framework 
The ASEST+ | team cohesion between | indicated that their | and focus on new and 
Framework the antecedents and the | perceptions of team | larger samples 
outputs. cohesion, team 

performance, and team 

learning significantly 

increased compared 

with the perceptions of 

students in the groups 

that did not receive this 

intervention. 
A Novel Software | The results showed the | This manuscript | an important HI with 
Engineering Approach | delivery of the new | proposed a framework | ML topic in software 
Toward Using Machine | methodology for | that included a | engineering by 
Learning for Improving | oneshot delivery. For | theoretical framework | proposing an efficient 
the Efficiency of Health | the MAM component | composed of four | new method approach 


Systems on the SEMLHI | modules (software, ML | related to software 
framework model, ML algorithms, | engineering 
and HI data). 
Non Verbal | our work will contribute | We analyzed both | Future work to focus, 
Communication in | to improve the channels | verbal and non-verbal | more and more, on non- 
Software Engineering — | and tools used for | intersactions in software | verbal communication 
An Empirical Study communication and | development, exploiting | among software 
collaboration in | an approach inspired by | developers cooperating 
software development | the principles of | via Internet and 
distributed cognition. applications like Zoom 
or MS Teams. 
On Using Grey | However in the future | Therefore to utilize this | Possible future work for 


Literature and Google 
Scholar in Systematic 
Literature Reviews in 
Software Engineering 


SLR we intend to come 
up with a more detailed 
mechanism of 
categorizing conference 
proceedings as GL 


information in a proper 
manner, we suggest 
simple strategies to 
categorize GL based on 
various attributes 


the study is to bridge the 
gap between academia 
and the GL utilization 
process 


Software Engineering 
for JIoT-Driven Data 
Analytics Applications 


Data analytics systems 
and applications that 
ingest data from IoTs 
exploits sensors as 
interconnected things to 
collect, exchange, and 
process contextual data 


The proposed research 
complements 
community-wide 
initiatives on exploiting 
engineering practices, 
processes, patterns, 
frameworks, and tools 
for developing IoT- 
driven systems and 
software. 


Future research requires 
more case studies to be 
investigated. 
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questions on software 
team formation. 


Systematic | Mapping | We conclude from the | SDLC phase has been | We will develop the 
Study on Security | above discussion that it | most discussed and | SSAM model by using 
Approaches in Secure | is not good enough to | addressed, publication | the output of future 
Software Engineering secure the software | venue, and SWOT | RQs, supervisor inputs, 
system in post- | analysis. The results| and guidance from 
development phases and | indicate that this domain | existing studies 
there is a dire need to | is still immature and 
figure out better ways | sufficient research work 
and means to secure the | needs to be carried out 
software system. I particularly on 
empirically evaluated 
solutions. 
Team Formation in | we intend to investigate | Through this study, | Consequently, we 
Software Engineering: | further some of the | practitioners can | cannot demonstrate the 
A Systematic Mapping | research directions | improve their decision- | taxonomy’s utility since 
Study presented in this | making process while | it would not be 
research by elaborating | forming a software | appropriate to apply the 
on new research | team. same set used to build it. 


Therefore, we intend to 
demonstrate 
taxonomy’s utility in 
future work. 


The Impact of Lexicon 


Designed to verify the 


a subset of the well- 


the effort required to 


Adaptation on the/| relevance of using | known ANEW lexicon | adapt the lexicon should 
Emotion Mining From | dedicated lexicons | was developed, in | rather be directed at 
Software Engineering | during the sentiment | which 50 words were | improving the emotion 
Artifacts analysis or emotion | evaluated in the context | recognition algorithms 
mining with the IT | of software 
artifacts. development projects 
by IT professionals. 
Towards Connecting | However, the scale of | In our work, we provide | The results also provide 
Discrete Mathematics | the examples used in| four sample projects, | stimulus to 
and Software | these works is small, | the implementation of | continuously improve 
Engineering and cannot | which involves around | our method and to 
comprehensively train | one or two thousand | design more 
students in software | lines of code. comprehensive projects 
development. to tightly connect more 
courses in the near 
future. 
The Influence Of) Work allocating model | The most suitable | Future study will 
Lexicon Adaptation On | and appropriate | virtual machines for | contain making use of 
Emotion Mining Using | algorithms in this paper | processing tasks are | the approach to a actual 


Software Engineering | to achieve desired job | chosen from a pool of | cloud computing 
Artifacts allotment, execution, | previously created ones | machine to assess its 
and cloud computing | based on task difficulty. | overall performance and 
service quality. enhancing 
4.CONCLUSION 


Using original data sets gathered from a Palestine hospital over the course of the last three years, we 
proposed an effective new method approach related to software engineering that had been identified 
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in previous research studies and addressed an important HI with ML topic in software engineering. 
Software engineers and ML experts can collaborate on the ML model life-cycle, particularly in the 
health field, thanks to this methodology, which makes it possible for developers to analyze and 
develop software for the HI model. The theoretical framework for the framework that was proposed 
in this manuscript was made up of four modules: software, ML model, ML algorithms, and HI data. 
Three system engineering approaches were compared to the new method: SEMLHI, Vee, and Agile 
the outcomes demonstrated the implementation of the new one-time delivery method. Laboratory test 
results were obtained using five algorithms for the MAM component of the SEMLHI framework. 
These algorithms were used to test the accuracy of the ML models and the ICD-10 results using 
equations with a sample size of 750 patients. The SVG was about 0.57, according to the MAM results. 
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